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We consider a flow-level model of a network operating under an 
a-fair bandwidth sharing policy (with a > 0) proposed by Roberts 
and Massoulie (2000). This is a probabilistic model that captures the 
long-term aspects of bandwidth sharing between users or flows in a 
communication network. 

We study the transient properties as well as the steady-state dis- 
tribution of the model. In particular, for a > 1, we obtain bounds 
on the maximum number of flows in the network over a given time 
horizon, by means of a maximal inequality derived from the standard 
Lyapunov drift condition. As a corollary, we establish the full state 
space collapse property for all a > 1. 

For the steady-state distribution, we obtain explicit exponential 
tail bounds on the number of flows, for any a > 0, by relying on a 
norm-like Lyapunov function. As a corollary, we establish the validity 
of the diffusion approximation developed by Kang et al (2009), in 
steady state, for the case where a — 1 and under a local traffic 
condtion. 

1. Introduction. We consider a flow-level model of a network that op- 
erates under an a-fair bandwidth-sharing policy, and establish a variety of 
new results on the resulting performance. These results include tail bounds 
on the size of a maximal excursion during a finite time interval, finiteness 
of expected queue sizes, exponential tail bounds under the steady-state dis- 
tribution, and the validity of the heavy-traffic diffusion approximation in 
steady state. We note that our results are to a great extent parallel and com- 
plementary to our work on packet-switched networks, which was reported 
in \m. 
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In the remainder of this section, we put our results in perspective by 
comparing them with earlier work, and conclude with some more details on 
the nature of our contributions. 

1.1. Background. The flow-level network model that we consider was 
introduced by Roberts and Massoulie [T7] to study the dynamic behavior of 
Internet flows. It builds on a static version of the model that was proposed 
earlier by Kelly, Maulloo, and Tan [15], and subsequently generalized by 
Mo and Walrand [16] who introduced a class of "fair" bandwidth-sharing 
policies parameterized by a > 0. 

The most basic question regarding flow-level models concerns necessary 
and sufficient conditions for stability, that is, for the existence of a steady- 
state distribution for the associated Markov process. This question was an- 
swered by Bonald and Massoulie [1] for the case of a-fair policies with a > 0, 
and by de Veciana et al. [6] for the case of max-min fair policies (a — > oo) 
and proportionally fair (a = 1) policies. In all cases, the stability conditions 
turned out to be the natural deterministic conditions based on mean arrival 
and service rates. 

Given these stability results, the natural next question is whether the 
steady-state expectation of the number of flows in the system is finite and 
if so, to identify some nontrivial upper bounds. When a > 1, the finiteness 
question can be answered in the affirmative, and explicit bounds can be 
obtained, by exploiting the same Lyapunov drift inequality that had been 
used in earlier work to establish stability. However, this approach does not 
seem to apply to the case where a £ (0, 1), which remained an open problem; 
this is one of the problems that we settle in this paper. 

A more refined analysis of the number of flows present in the system con- 
cerns exponentially decaying bounds on the tail of its steady-state distribu- 
tion. We provide results of this form, together with explicit bounds for the 
associated exponent. While a result of this type was not previously available, 
we take note of related recent results by Stolyar [21] and Venkataramanan 
and Lin [23] who provide a precise asymptotic characterization of the expo- 
nent of the tail probability, in steady state, for the case of switched networks 
(as opposed to flow- level network models). (To be precise, their results con- 
cern the (1 + a) norm of the vector of flow counts under maximum weight 
or pressure policies parameterized by a > 0.) We believe that their meth- 
ods extend to the model considered here, without much difficulty. However, 
their approach leads to a variational characterization that appears to be 
difficult to evaluate (or even bound) explicitly. We also take note of work by 
Subramanian [22], who establishes a large deviations principle for a class of 
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switched network models under maximum weight or pressure policies with 
a. = 1. 

The analysis of the steady-state distribution for underloaded networks 
provides only partial insights about the transient behavior of the associated 
Markov process. As an alternative, the heavy-traffic (or diffusion) scaling 
of the network can lead to parsimonious approximations for the transient 
behavior. A general two-stage program for developing such diffusion approx- 
imations has been put forth by Bramson [5] and Williams [23], and has been 
carried out in detail for certain particular classes of queueing network mod- 
els. To carry out this program, one needs to: (i) provide a detailed analysis of 
a related fluid model when the network is critically loaded; and, (ii) identify 
a unique distributional limit of the associated diffusion-scaled processes by 
studying a related Skorohod problem. The first stage of the program was 
carried out by Kelly and Williams [14] who identified the invariant manifold 
of the associated critically loaded fluid model. This further led to the proof 
by Kang et al. [13] of a multiplicative state space collapse property, simi- 
lar to results by Bramson [5]. We note that the above summarized results 
hold under a-fair policies with an arbitrary a > 0. The second stage of the 
program has been carried out for the proportionally fair policy (a = 1) by 
Kang et al. [13] . under a technical local traffic condition, and more recently, 
by Ye and Yao [25], under a somewhat less restrictive technical condition. 
We note however that when a/l,a diffusion approximation has not been 
established. In this case, it is of interest to see at least whether properties 
that are stronger than multiplicative state space collapse can be derived, 
something that is accomplished in the present paper. 

The above outlined diffusion approximation results involve rigorous state- 
ments on the finite-time behavior of the original process. Kang et al. [13] 
further established that for the particular setting that they consider, the 
resulting diffusion approximation has an elegant product-form steady-state 
distribution; this result gives rise to an intuitively appealing interpretation 
of the relation between the congestion control protocol utilized by the flows 
(the end-users) and the queues formed inside the network. It is natural to 
expect that this product-form steady-state distribution is the limit of the 
steady-state distributions in the original model under the diffusion scaling. 
Results of this type are known for certain queueing systems such as gen- 
eralized Jackson networks; see the work by Gamarnik and Zeevi [10] , On 
the other hand, the validity of such a steady-state diffusion approximation 
was not known for the model considered in [13] : it will be established in the 
present paper. 



4 



SHAH & TSITSIKLIS & ZHONG 



1.2. Our contributions. In this paper, we advance the performance anal- 
ysis of flow-level models of networks operating under an a-fair policy, in 
both the steady-state and the transient regimes. 

For the transient regime, we obtain a probabilistic bound on the maximal 
(over a finite time horizon) number of flows, when operating under an a- 
fair policy with a > 1. This result is obtained by combining a Lyapunov 
drift inequality with a natural extension of Doob's maximal inequality for 
non-negative supermartingales. Our probabilistic bound, together with prior 
results on multiplicative state space collapse, leads immediately to a stronger 
property, namely, full state space collapse, for the case where a > 1. 

For the steady-state regime, we obtain non-asymptotic and explicit bounds 
on the tail of the distribution of the number of flows, for any a > 0. In the 
process, we establish that, for any a > 0, all moments of the steady-state 
number of flows are finite. These results are proved by working with a normed 
version of the Lyapunov function that was used in prior work. Specifically, 
we establish that this normed version is also a Lyapunov function for the 
system (i.e., it satisfies a drift inequality). It also happens to be a Lipschitz 
continuous function and this helps crucially in establishing exponential tail 
bounds, using results of Hajek [12] and Bertsimas et al. [2]. 

The exponent in the exponential tail bound that we establish for the 
distribution of the number of flows is proportional to a suitably defined 
distance ("gap") from critical loading; this gap is of the same type as the 
familiar 1 — p term, where p is the usual load factor in a queueing system. 
This particular dependence on the load leads to the tightness of the steady- 
state distributions of the model under diffusion scaling. It leads to one of our 
main results, namely, the validity of the diffusion approximation, in steady 
state, when a = 1 and a local traffic condition holds. 

1.3. Organization. The rest of the paper is organized as follows. In Sec- 
tion 2, we define the notation and some of the terminology that we will em- 
ploy. We also describe the flow-level network model, as well as the weighted 
a-fair bandwidth-sharing policies. In Section 3, we provide formal statements 
of our main results. The transient analysis is presented in Section 4. We start 
with a general lemma, and specialize it to obtain a maximal inequality under 
a-fair policies, when a > 1. We then apply the latter inequality to prove full 
state space collapse when a > 1. 

We then proceed to the steady-state analysis. In Section 5, we establish 
a drift inequality for a suitable Lyapunov function, which is central to our 
proof of exponential upper bounds on tail probabilities We prove the ex- 
ponential upper bound on tail probabilities in Section 6. The validity of 
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the heavy-traffic steady-state approximation is established in Section 7. We 
conclude the paper with a brief discussion in Section 8. 

2. Model and Notation. 

2.1. Notation. We introduce here the notation that will be employed 
throughout the paper. We denote the real vector space of dimension M by 
M. M , the set of nonnegative M-tuples by R5r, and the set of positive M- 
tuples by M*f . We write R for M 1 , M+ for R\, and R p for H*. We let Z 
be the set of integers, Z + the set of nonnegative integers, and N the set of 
positive integers. Throughout the paper, we reserve bold letters for vectors 
and plain letters for scalars. 

For any vector x € M. M , and any a > 0, we define 




and we define = max jg r 1 m\ \ x i\- For any two vectors x = (xi)f£ 1 

and y = {yi)fL 1 of the same dimensions, we let (x, y) = Yli=i x iVi ^ e the 
inner product of x and y. We let be the i-th. unit vector in R M , and 1 
the vector of all ones. For a set S, we denote its cardinality by \S\, and its 
indicator function by Is- For a matrix A, we let A T denote its transpose. 

2.2. Flow-Level Network Model. 

The Model. We adopt the model and notation in jT3]. As explained in 
detail in [14] , this model faithfully captures the long-term (or macro level) 
behavior of congestion control in the current Internet. 

Let time be continuous and indexed by t € R+. Consider a network with 
a finite set J of resources and a set X of routes, where a route is identified 
with a non-empty subset of the resource set J. Let A be the \ J\ x \X\ 
matrix with Aji = 1 if resource j is used by route i, and Aji = otherwise. 
Assume that A has rank |J7"|. Let C = (Cj)j^j be a capacity vector, where 
we assume that each entry Cj is a given positive constant. Let the number 
of flows on route i at time t be denoted by Ni(t), and define the flow vector 
at time t by N(t) = (A r j(t))j g j. For each route i, new flows arrive as an 
independent Poisson process of rate v\. Each arriving flow brings an amount 
of work (data that it wishes to transfer) which is an exponentially distributed 
random variable with mean 1/fii, independent of everything else. Each flow 
gets service from the network according to a bandwidth-sharing policy Once 
a flow is served, it departs the network. 
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The a-Fair Bandwidth- Sharing Policy. A bandwidth sharing policy has to 
allocate rates to flows so that capacity constraints are satisfied at each time 
instance. Here we discuss the popular a-fair bandwidth-sharing policy, where 
a > 0. At any time, the bandwidth allocation depends on the current number 
of flows n = (nj)jgj. Let Aj be the total bandwidth allocated to route i under 
the a-fair policy: each flow of type i gets rate Aj/raj if rii > 0, and Aj = if 
Hi = 0. Under an a-fair policy, the bandwidth vector A(n) = (Aj(n))j e i is 
determined as follows. 

If n = 0, then A = 0. If n ^ 0, then let X+(n) = {i G 1 : m > 0}. For 
i X+(n), set Aj(n) = 0. Let A + (n) = (Aj(n))j gZ+ ( n ). Then, A + (n) is the 
unique maximizer in the optimization problem 



(1) 
(2) 



Ixi 

maximize G n (A + ) over A € W + 
subject to ^ AjiAi <Cj, V j € J, 

ieX+ (n) 



where 



G„(A_ 



E 



K{n. 



1-a 



1 — a ' 

iSX + (n) 

^ KjnjlogAj, if a = 1. 

I, ieX + (n) 



if a € (0,oo)\{l}, 



Here, for each i € I, Ki is a positive weight assigned to route i. 

Some crucial properties of A(n) are as follows (see Appendix A of 

(i) Aj(n) > for every i € Z + (n); 

(ii) A(rn) = A(n) for r > 0; 

(hi) For every n and every i € X + (n), the function Aj(-) is continuous at 
n. 

Flow Dynamics. The flow dynamics are described by the evolution of the 
flow vector N(t) = (Ni(t))i^x, a Markov process with infinitesimal transition 
rate matrix q given by 



(3) 



q(n, n + m) 




if m = ej, 
if m = — ej 
otherwise, 



and rii > 1, 



where for each i, Vi > and Ui > are the arrival and service rates defined 
earlier, and ej is the i-th unit vector. 
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Capacity Region. Flows of type i bring to the system an average of pi = 
Vij Hi units of work per unit time. Therefore, in order for the Markov process 
N(-) to be positive recurrent, it is necessary that 



We note that under the a-fair bandwidth-sharing policy, Condition ^ is 
also sufficient for positive recurrence of the process N(-) |H l6j H3]. 

2.3. A note on our use of constants. Our results and proofs involve var- 
ious constants; some are absolute constants, some depend only on the struc- 
ture of the network, and some depend (smoothly) on the traffic parameters 
(the arrival and service rates). It is convenient to distinguish between the 
different types of constants, and we define here the terminology that we will 
be using. 

The term absolute constant will be used to refer to a quantity that does 
not depend on any of the model parameters. The term network- dependent 
constant will be used to refer to quantities that are completely determined 
by the structure of the underlying network and policy, namely, the inci- 
dence matrix A, the capacity vector C, the weight vector k, and the policy 
parameter a. 

Our analysis also involves certain quantities that depend on the traf- 
fic parameters, namely, the arrival and service parameters /x and v. These 
quantities are often given by complicated expressions that would be incon- 
venient to carry through the various arguments. It turns out that the only 
property of such quantities that is relevant to our purposes is the fact they 
change continuously as /x and v vary over the open positive orthant. (This 
still allows these quantities to be undefined or discontinuous on the bound- 
ary of the positive orthant.) We abstract this property by introducing, in the 
definition that follows, the concept of a (positive) load- dependent constant. 

Definition 2.1. Consider a family of bandwidth- sharing networks with 
common parameters (A,C,k,q), but varying traffic parameters (p,,v). A 
quantity K will be called a (positive) load-dependent constant if for networks 
in that family it is determined by a relation of the form K = /(/x, v), where 



A key property of a load-dependent constant, which will be used in some 
of the subsequent proofs, is that it is by definition positive and further- 
more (because of continuity), bounded above and below by positive network- 
dependent constants if we restrict /x and v to a compact subset of the open 



(4) 



Ap < C, componentwise. 




continuous function on the open positive orthant 



R l p l x R l p l . 
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positive orthant. A natural example of a load-dependent constant is the load 
factor pi = Vij (Note that this quantity diverges as fa — s> 0.) 

We also define the gap of a underloaded bandwidth-sharing network. 

Definition 2.2. Consider a family of bandwidth- sharing networks with 
common parameters (A, C, K, a) and with varying traffic parameters (p, v) 
that satisfy Ap < C. The gap of a network with traffic parameters (p, u) 
in the family, denoted by e(p), is defined by 

e(p) = sup{e > : (1 + e)Ap < C}. 

We sometimes write e for e(p) when there is no ambiguity. Note also that 
e(p) plays the same role as the term 1 — p in a queueing system with load p. 

2.4. Uniformization. Uniformization is a well-known device which allows 
us to study a continuous-time Markov process by considering an associated 
discrete-time Markov chain with the same stationary distribution. We pro- 
vide here some details and the notation that we will be using. 

Recall that the Markov process N(-) of interest has dynamics given by 
([3]). Let S(n) = ^n<7( n iri) be the aggregate transition rate at state n. The 
embedded jump chain of N(-) is a discrete-time Markov chain with the same 

m 

state space and with transition probability matrix P given by 

q{n,h) 

p(n ' n) = w 

The so-called uniformized Markov chain is an alternative, more convenient, 
discrete-time Markov chain, denoted (N(r) I , to be defined shortly. 

V / tGZ+ 

We first introduce some more notation. Consider the aggregate transition 
rates H(n) = ^gfn, n). Since every route uses at least one resource, we 
have Aj(n) < max^gj- Cj, for all i El. Then, by ([3]), we have 

s ( n ) = ^2 q ( n > ") - ^ ( Ui + W A *( n )) <^2(^i + Mi m ax Cj ) • 

We define S = X^igj ( u i + M« max jg.7 Cj), and modify the rates of self- 
transitions (which were zero in the original model) to 

(5) g(n,n) := H-H(n). 

Note that S is a positive load-dependent constant. We define a transition 
probability matrix P by 

~ A g(n,n) 
P(n, n) = — — — . 



9 



Definition 2.3. The uniformized Markov chain ^N(r)J associated 

with the Markov process N(-) is a discrete-time Markov chain with the same 
state space Z\, and with transition matrix P defined as above. 

As remarked earlier, the Markov process N(-) that describes a bandwidth- 
sharing network operating under an a-fair policy is positive recurrent, as 
long as the system is underloaded, i.e., if Ap < C. It is not hard to ver- 
ify that N(-) is also irreducible. Therefore, the Markov process N(-) has 
a unique stationary distribution. The chain N(-) is also positive recurrent 
and irreducible, because N(-) is, and by suitably increasing S if necessary, 
it can be made aperiodic. Thus N(-) has a unique stationary distribution 
as well. A crucial property of the uniformized chain N(-) is that this unique 
stationary distribution is the same as that of the original Markov process 
N(-); see, e.g., [9]. 

2.5. A Mean Value Theorem. We will be making extensive use of a 
second-order mean value theorem [T], which we state below for easy ref- 
erence. 

Proposition 2.4. Let g : R M — > R be twice continuously differ entiable 
over an open sphere S centered at a vector x. Then, for any y such that 
x + y 6 S, there exists 6 € [0, 1] such that 

(6) </(x + y) = ff(x) + y T V 5 (x) + ^y T # (x + 0y)y, 

6 M M is the gradient of g at x, and H(x.) = 
£ ]^ MxM i s the Hessian of the function g at x. 

3. Summary of Results. In this section, we summarize our main re- 
sults for both the transient and the steady-state regime. The proofs are given 
in subsequent sections. 

3.1. Transient Regime. Here we provide a simple inequality on the max- 
imal excursion of the number of flows over a finite time interval, under an 
a-fair policy with a > 1. 

Theorem 3.1. Consider a bandwidth-sharing network operating under 
an a-fair policy with a > 1, and assume that Ap < C. Suppose that N(0) = 



where Vg(x) 



ag(x) 

dxi 



10 



SHAH & TSITSIKLIS & ZHONG 



0. Let N*(T) = swPte[o,T],iei Ni(t) , and lets be the gap. Then, for any b > 0, 

for some positive load- dependent constant K . 

As an important application, in Section 14.31 we will use Theorem 13.11 
to prove a full state space collapse result, when a > 1. (As discussed in 
the introduction, this property is stronger than multiplicative state space 
collapse.) The precise statement can be found in Theorem 14. 101 

3.2. Stationary Regime. As noted earlier, the Markov process N(-) has 
a unique stationary distribution, which we will denote by 7r. We use and 
P,r to denote expectations and probabilities under 7v. 

Exponential Bound on Tail Probabilities. For an a-fair policy, and for any 
a G (0, oo), we obtain an explicit exponential upper bound on the tail proba- 
bilities for the number of flows, in steady state. This will be used to establish 
an "interchange of limits" result in Section [7J See Theorem 17.61 for more de- 
tails. 

Theorem 3.2. Consider a bandwidth- sharing network operating under 
an a-fair policy with a > 0, and assume that Ap < C. Let e be the gap. 
There exist positive constants B, K , and £ such that for all £ G Z + : 

(8) IP-n- (||N|U > B + 2#) < (t^) + • 

Here£ and K are load- dependent constants, and B takes the form K' je when 
a > 1, and K' / minje 1 /", e} when a G (0, 1), with K' being a positive load- 
dependent constant. In particular, all moments of ||N||oo o,re finite under the 
stationary distribution iv, i.e., E^fllN^] < oo for every k G N. 

Here we note that Theorem 13.21 implies the following. The system load 
L(p), defined by L(p) = 1+£ / p j , satisfies L(p) w 1 — e(p) when e = e(p) 
is small, i.e., when the system approaches criticality. Then, an immediate 
consequence of the bound flSJ) is that 

limsup-logP^dlNlU > 7 ) < 

7— >oo 7 



— log 

2£ S 






Ke r 




? 
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Note that ^ is a load-dependent constant. Thus Theorem 13.21 shows that 
the large-deviations exponent of the steady-state number of flows is upper 
bounded by —(1 — L(p)), up to a multiplicative constant. 

Interchange of Limits (a = 1). As discussed in the introduction, when 
a = 1, Theorem 13.21 leads to the tightness (Lemma IT.Tf) of the steady-state 
distributions of the model under diffusion scaling. This in turn leads to The- 
orem EH and Corollary 17. 101 ° n the validity of the diffusion approximation in 
steady state. As the statements of these results require a significant amount 
of preliminary notation and background (which is introduced in Section [7]) , 
we give here an informal statement. 

Interchange of Limits theorem (informal statement): Consider a 
sequence of flow-level networks operating under the proportionally fair policy. 
Let N r (-) be the flow-vector Markov process associated with the rth network, 
let e r be the corresponding gap, and let tt 1 be the stationary distribution of 
e r N r (-). As E r —7- 0, and under certain technical conditions, tv t converges 
weakly to the stationary distribution of an associated limiting process. 

4. Transient Analysis (a > 1). In this section, we present a transient 
analysis of the a-fair policies with a > 1. First we present a general maximal 
lemma, which we then specialize to our model. In particular, we prove a 
refined drift inequality for the Lyapunov function given by 

(9) Ft>( „ ) = _L_g wr .(^* +1 . 

This Lyapunov function and associated drift inequalities have played an 
important role in establishing positive recurrence (cf. [1], [6], p3]) and mul- 
tiplicative state space collapse (cf. [13]) for a-fair policies. We combine our 
drift inequality with the maximal lemma to obtain a maximal inequality 
for bandwidth-sharing networks. We then apply the maximal inequality to 
prove full state space collapse when a > 1. 

4.1. The Key Lemma. Our analysis relies on the following lemma. 

Lemma 4.1. Let (J^n)nez + be a filtration on a probability space. Let 
(X n ) n£ z + be a nonnegative & n -adapted stochastic process that satisfies 

(10) E[X n+1 | & n ] < X n + B n 

where the B n are nonnegative random variables (not necessarily ^ n -adapted) 
with finite means. Let X* = maxjXo, . . . , X n } and suppose that Xq = 0. 
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Then, for any a > and any T G Z + , 

a 

This lemma is a simple consequence of the following standard maximal 
inequality for nonnegative supermartingales (see for example, Exercise 4, 
Section 12.4, of [II]). 

Theorem 4.2. Let (^ n )nez + ^ e a filtration on a probability space. Let 
(Y n ) n€ z + be a nonnegative & n - adapted supermartingale, i.e., for all n, 

e[y u+1 | jg < Y n . 

Let Y£ = max{Yo, . . . , It}. Then, 

F(Yf > a) < m. 

Proof of Lemma 14. 1[ First note that if we take the conditional expec- 
tation of both sides of (I10p , given & n , we have 

E[X n+1 | jy < E[X n I & n ] + E[B n | & n ] = X n + E[B n \ jg. 

Fix T G Z+. For any n <T, define 



Y n = X n + E 



'T-l 



.k=n 



<¥ 



Then 



E[Y n+1 | = E[X n+1 | + E : 

< X n + E[B n | jg + E 



T-l 



E * 

=n+l 
T-l 

E ^ 



^ r. 



.k=n+l 



Thus, Y n is an ^ n -adapted supermartingale; furthermore, by definition, Y n 
is nonnegative for all n. Therefore, by Theorem 14.21 



E\Y n 



E 



n B k 
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But Y n > X n for all n, since the Bk are nonnegative. Thus, 



Ejfc=0 B k 



E 

H X T >a)< P(lr > a) < — 

□ 

Since we are dealing with continuous-time Markov processes, the following 
corollary of Lemma 14.11 will be useful for our analysis. 

Corollary 4.3. Let (^t)t>o be a filtration on a probability space. Let 
Zt be a nonnegative, right- continuous J^t-adapted stochastic process that sat- 
isfies 

E[Z s+t \^ s ] < Z s + Bt, 

for all s,t > 0, where B is a nonnegative constant. Assume that Zq = 0. 
Denote Z^ = sup 0<t<T Z t (which can possibly be infinite). Then, for any 
a > 0, and for any T > 0, 

BT 

F(Zt >a)< . 

a 

Proof. The proof is fairly standard. We fix T > and a > 0. Since Zt 
is right-continuous, Z^ = sup tg r 0) r] Z t = sup tg ([ ,T]nQ)u{T} Z t . Consider an 
increasing sequence of finite sets I n so that U^ =l I n = ([0, T] n Q) U {T}, 

and 0,T 6 I n for all n. Define Zj^ = sup tg j Zt- Then (zfp ) is a non- 

V / n=l 

(n) 

decreasing sequence, and Zj, — > Zj^ as n — > oo, almost surely. For each 
z!p , we can apply Lemma l4.lt an d it is immediate that for any b > 0, 

(ii) p(4»> > b ) < 

since each /„, includes both and T. Since Zjf* increases monotonically to 
Z?p, almost surely, we have that F(Z^ > b) < P(Z^ +1 ^ > b) for all n, and 
P(Z^ n) > b) -> P(Z£ > 6) as ra -> oo. The right-hand side of CLI]) is fixed, so 

RT 

P(^T >&)<—• 

We now take an increasing sequence b n with limn^oo 6 n = a, and obtain 

F(Zt >a)< . 

a 

□ 
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4.2. A Maximal Inequality for Bandwidth-Sharing Networks. We employ 
the Lyapunov function Q to study a-fair policies. This is the Lyapunov 
function that was used in [4], [6] and [14] to establish positive recurrence of 
the process N(-) under an a-fair policy. Below we fine-tune the proof in [6] 
to obtain a more precise bound on the Lyapunov drift. We note that a "fluid- 
model" version of the following lemma appeared in the proof of Theorem 
1 in [4]. For notational convenience, we drop the subscript a from F a and 
write F instead. 

LEMMA 4.4. Consider a bandwidth- sharing network with Ap < C op- 
erating under an a-fair policy with a > 0. Let e be the gap. Then, for any 
non-zero flow vector n, 

(V.F(n), v - pA(n)> < -e (VF(n), v) , 

where (-, •) denotes the standard inner product, VF(n) denotes the gradient 
of F, and /xA(n) is the vector (/ijAj(n)) ig j. 

Proof. We have 



1 / n \ 

(VF(n), v - AtA(n)) = V —m f — ) - ^A 4 (n)) 

= <VG n (p + ),p + -A + (n)>, 

where p + = (pi) ie x + (n)- Similarly we can get (VF(n), v) = (VG„(p + ), p + ). 
Now consider the function g : [0, 1] — >• R defined by 

g{9) = G n {e(l+e) P+ + (1 - 0)A+(n)). 

Since (1 + e)p + satisfies the constraints in ([2]), and A + (n) maximizes the 
strictly concave function G n subject to the constraints in (J2j) , we have 

G n ((l + e) P+ ) < G n (A+(n)), i.e., g(l) < g(0). 
Furthermore, since G n is a concave function, g is also concave in 6. Thus, 

3(0) < g(l) + (0 - l)g'{\) < g(Q) + (0 - l)g'(l). 
Hence, g'(l) < 0, i.e., 

(12) %\ 9=1 = < VGn((1 + £)p + } ' (1 + £)P + " A+(n) > " °- 
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But it is easy to check that VG n ((l+e)p + ) = (1+e) Q VG n (p + ), so dividing 
(fT2|) by (1 + e)" a , we have 



(VG n (p + ),p + -A + (n)> <-e(VG n (p + ),p + ). 
This is the same as 

(VF(n), v - /xA(n)) < -e (VF(n), i/) . 

□ 

Our next lemma provides a uniform upper bound on the expected change 
of -F(N(-)) in one time step, where N(-) is the uniformized chain associated 
with the Markov process N(-) (cf. Definition 12.31) . 

Lemma 4.5. Let a > 1. As above, consider a bandwidth- sharing net- 
work with Ap < C operating under an a-fair policy. Let e be the gap. Let 
^N(t)^ be the uniformized chain associated with the Markov process 

N(-). Then, there exists a positive load- dependent constant K, such that for 
all t € Z + , 



E 



F(N(r + 1)) - F(n) | N(t) = n 



< Ke l ~ a . 



Proof. By the mean value theorem (cf. Proposition I2.4p . for n, m € ', 
we have 

(13) F(n + m) - F(n) = (VF(n), m) + ^m T V 2 F(n + 6m)m, 
for some 6 £ [0, 1]. We note that, for m = ±ej, we have 

(14) < ^(n l + ir-\ 

since a > 1, and 6 6 [0, 1]. 
As in [6], we define 

QF(n) = 9( n ' n + m )[ F ( n + m) - F(n)], 
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so that Q is the generator of the Markov process N(-). We now proceed 
to derive an upper bound for Qi ? (n). Using Equation (|13p . we can rewrite 
Q-F(n) as 

QF(n) = ^g(n,n + m) (VF(n), m) + -m T V 2 F( n + # m m)m 

m L 

= q(n, n + m) (V-F(n), m) 

m 

+— ^2 9( n i n + m)m T V 2 F(n + # m m)m, 

m 

for some scalars 6 m € [0, 1], one such scalar for each m. From the definition 
of q, we have 

q(n, n + m) (VF(n), m) = ( VF(n),^g(n,n + m)ra ) 

m \ m / 

= (VF(n),i/-/*A(n)). 
From (|14p . for m = ±ej, we also have 

^m T V 2 F(n + # m m)m < Kia{m + l) a ~ l /2p lP f. 

Thus, 

KiOL 



QF(n) < (VF(n), I ,-M(n))+^-^(n J + ir- 1 (z, i + M J (n)) 

iez VPi/ iex Pi 

< _ me ^ n « + M ^(n, + ir- 1 , 

where the second inequality follows from Lemma [4.44 and the third by defin 
ing 

A ■ 1— a n /r A K i a ( n \ 

m = mmKiPi , M = max — — I pi + maxCj I , 



and noting the fact that since Aj(n) < maxj^j Cj for all i, we have M > 
maxjgx fy£(pi + Aj(n)). It is then a simple calculation to see that for every 
n > 0, we have 

Q-F(n) < -me < + M ^(n* + l)"" 1 < ft 1 "", 
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for some positive load-dependent constant K. Now given N(r) = n, 

-l-Q 



E 



F(N(r + 1)) - F(n) | N(r) = n 



Q-F(n) AV 



By setting K = K/3, we have proved the lemma. 



□ 



Corollary 4.6. Let a > 1. As before, suppose that Ap < C, and let e 
be the associated gap. Then, under the a- fair policy, the process N(-) satisfies 

E [F(N(s + t)) - F(N(s)) | N(s)] < Kte^ a , for all t > 0. 

/or some positive load- dependent constant K . 

Proof. The idea of the proof is to show that the expected number of 
state transitions of N(-) in the time interval [s, s + 1] is of order 0(t). 

Consider the uniformized Markov chain N(-) associated with the process 
N(-). Denote the number of state transitions in the uniformized version of 
the process N(-) in the time interval [s, s + t] by r. By the Markov property, 
time-homogeneity, and the definition of N(-), we have 

E [F(N(s + t)) - F(N(s)) | N(s) = n] 



E 



F(N(t)) - F(N(0)) I N(0) = n 



Now, by the definition of the uniformized chain, r and N(-) are independent. 
Thus, 



E 



E 



E 



E 



< E 



E [.F(N(t)) - F(N(0)) | N(0) 

r-1 

J2(F(N(k + l))-F(N(k)) 



k=0 



E 



r-1 



^2[F(N(k + l))-F(N(k)) 

k=0 

^2 E \F(N(k + 1)) - F(N(k)) 



,k=0 
r-1 



N(0) 

N(0),r 
N(0),rl N(0) 









N(0) 



^2 E \F(N{k + 1)) - F(N(k)) | N(0) N(0) 



fc=0 
r-1 



_fc=0 
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for some load-dependent constant K. The fourth equality follows from the 
independence of r and N(-), and the inequality follows from Lemma 14.51 
Since the counting process of the number of state transitions in the uni- 
formized version of the process N(-) is a time-homogeneous Poisson process 
of rate H, we have E[r] = St. This shows that 

E [F(N(s + t)) - F(N(s)) | N(s)] < KEte l ~ a . 

The proof is concluded by setting K = KS. □ 

Proof of Theorem \3.1[ Let 6 > 0. Then 

¥(N*(T)>b) = F^-^(N*(T)) a+1 > ^y& q+1 ) 

< P| sup F(N(i)) > fmin — ^—Kitf*~ 1 v^ a \b ci+1 \ 
\te[o,T] \ iex a + 1 J J 



a + DK'T KT 



mm^^r^rV- 1 ^ 1 eQ " 16Q+1 ' 



< 



where the second inequality follows from Corollary 14.31 and Corollary 14.61 
K' is as in Corollary 14.61 and K = — — _ a . □ 

mm i6X n l pu l v. 

4.3. Full State Space Collapse for a > 1. Throughout this section, we 
assume that we have fixed a > 1, and correspondingly, the Lyapunov func- 
tion ([9]) . To state the full state space collapse result for a > 1 , we need some 
preliminary definitions and the statement of the multiplicative state space 
collapse result. 

Consider a sequence of bandwidth-sharing networks indexed by r, where 
r is to be thought of as increasing to infinity along a sequence. Suppose that 
the incidence matrix A, the capacity vector C and the weights {Kj : i € 1} do 
not vary with r. Write N r (i) for the flow- vector Markov process associated 
with the rth network. Similarly, we write v r , fi r , p r , etc. We assume the 
following heavy-traffic condition (cf. [13"]): 



Assumption 4.7. We assume that Ap r < C for all r. We also assume 
that there exist v, /x £ W + and 6 > 0, smc/i i/iot > and /Xj > /or all 
i EX, v r —> v and /j, r —¥ /j, as r — > oo, and r(C — Ap r ) — > 9 as r —¥ oo. 



Note that our assumption differs from that in [13], which allows conver- 
gence to the critical load from both overload and underload, whereas here 
we only allow convergence to the critical load from underload. 



19 



To state the multiplicative state space collapse result, we also need to 
define a workload process W(i) and a lifting map A. 

IXI I 71 

Definition 4.8. We first define the workload w : — > R^: 1 associ- 
ated with a flow-vector n by w = w(n) = AM ^n, where M = diag(/^) is 
the \I\ x \I\ diagonal matrix with fj, on its diagonal. The workload process 

W(t) is defined to be W(t) = AM _1 N(i), for all t>0. We also define the 

I 71 

lifting map A. For each w £ W + , define A(w) to be the unique value of 
m 

n £ W, that solves the following optimization problem: 
minimize F(n) 

subject to Yji & i A ji% > w j> j^J, 
rii > 0, i € X. 

For simplicity, suppose that all networks start with zero flows. We consider 
the following diffusion scaling: 

(15 ) N-( ( ) = and W-( t ) = ™, 

where W r (t) = A(M r )- 1 N r (t), and M r = diag(/i r ). 

The following multiplicative state space collapse result is known to hold. 

Theorem 4.9 (Multiplicative State Space Collapse (EH Theorem 5.1]). 
Fix T > and assume that a > 1. Write ||x(-)|| = sup ie j t T],iei \ x i(t)\- 
Then, under Assumption\4-7\ and for any 5 > 0, 



/ N r • - A W - A 

lim P i! — i ^ > 5 =0. 

r^oo ^ ||N r (-)|| J 

We can now state and prove a full state space collapse result: 

Theorem 4.10 (Full State Space Collapse). Under the same assump- 
tions as in Theorem\4-9[ and for any 5 > 0, 



T— >OQ 



lim P ||N r (-) - A(W r (-))|| > 5 =0. 



Proof. Let e r = e{p r ) be the gap in the rth system. Then, under As- 
sumption 33 £r > D/r for some network-dependent constant D > 0, and 
for r sufficiently large. By Theorem 13. 1\ for any 6 > 0, and for sufficiently 
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large r, 

2 T) >b) < 

< 



K r r 2 T 

K r r 1+a T 
Da -i ha +i- 



Here, K r is a load-dependent constant associated with the rth system, as 
specified in the proof of Theorem 13.11 From the proof of Theorem 13-H note 

also that K r = f(fj, r ,v r ), for a function / that is continuous on the open 

IXI XI 

positive orthant M p x W p . Since p r — > p > 0, and u r — > v > 0, K r — > 
K = /(/x, u) € M. In particular, the K r are bounded, and for all sufficiently 
large r, 

P (N r >*(r 2 T) >b)< {K+1 ^ 1+ " T . 
Then, with a = b/r and under the scaling in (|15D . 

K + 1 T 



(16) P(|PNT(-)||>a)< 

for any a > 0. 

For notational convenience, we write 

S(r) = ||N r (.)-A(W(-))||. 
Then, for any a > 1, and for sufficiently large r, 

iB(r)>5) < P| ^ (r) > - or ||N r (-)|| > a 




^ P ^7^>- + P (H Nr (-)ll^ a )- 



Note that by Theorem 14.91 the first term on the right-hand side goes to as 
r — > oo, for any a > 0. The second term on the right-hand side can be made 
smaller than any, arbitrarily small, constant (uniformly, for all r), by taking 
a sufficiently large (cf. Equation (Tl6j) ). Thus, P(B(r) > 5) — > as r — > oo. 
This concludes the proof. □ 

5. ex-Fair Policies: A Useful Drift Inequality. We now shift our 
focus to the steady-state regime. The key to many of our results is a drift 
inequality that holds for every a > and every p > with Ap < C. In this 
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section, we shall state and prove this inequality. It will be used in Section [6] 
to prove Theorem 13.21 

We define the Lyapunov function that we will employ. For a > 1, it will 
be simply the weighted (a + l)-norm L a (n) = a+ ^/ (a + l)F a (n) of a vector 

n, where F a was defined in Q. However, when a £ (0, 1), this function 

HI 

has unbounded second derivatives as we approach the boundary of M + . For 
this reason, our Lyapunov function will be a suitably smoothed version of 



L + to be h a (r) = r a , when a > 1, 



Definition 5.1. Define h a : R + - 
and 

r a , if r > 1, 

(a — l)r 3 + (1 — a)r 2 + r, if r < 1, 



h a (r) 



when a € (0,1). Let H a 



be the antiderivative of h a , so that 



H a (f) = J r /i a (s)ds. The Lyapunov function L c 
be 



is defined to 



L a (n) 



i 

o+l 



Wi = Ki/J,f l v i a for each i Gl, 



For notational convenience, define 
(17) 

so that more compactly, we have 

F a (n) = — Vfflinf 1 " 1 , and L a (n) 
a + 1 ^ 



(a + l)y^Wjg a ( 



71: 



l/(a+l) 



We will make heavy use of various properties of the functions h a , H a , and 
L a , which we summarize in the following lemma. The proof is elementary 
and is omitted. 



Lemma 5.2. Let a € (0, 1). The function h a has the following properties: 

(i) it is continuously differentiable with h a (0) = 0, h a (l) = 1, h' a (0) = 1, 
and h' a (l) = a; 

(ii) it is increasing and, in particular, h a {r) > for all r > 0; 
(Hi) we have r a — 1 < h a {r) < r a + 1, for all r G [0, 1]; 

(iv) h' a (r) < 2, for all r > 0. 

Furthermore, from ( Hi ), we also have the following property of H a : 
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(Hi') r a+1 - 2 < (a + l)-ff a (r) < r a+1 + 2 for all r > 0. 

We are now ready to state the drift inequality. Here we consider the 
uniformized chain I N(r) ) associated with N(-), and the corresponding 

drift. 

Theorem 5.3. Consider a bandwidth- sharing network operating under 
an a-fair policy with a > 0, and assume that Ap < C. Let e be the gap. 
Then, there exists a positive constant B and a positive load- dependent con- 
stant K, such that i/L a (N(r)) > B, then 

(18) E[L Q (N(r + 1)) - L a (N(r)) | N(r)] < -sK. 

Furthermore, B takes the form K' / e when a > 1, and K' / min{e 1//a ,e} when 
a € (0, 1), with K' being a positive load- dependent constant. 

As there is a marked difference between the form of L a for the two cases 
a > 1 and a G (0, 1), the proof of the drift inequality is split into two parts. 
We first prove the drift inequality when a > 1, in which case L a takes a 
nicer form, and we can apply results on F a from previous sections. The 
proof for the case a € (0, 1) is similar but more tedious. We note that such 
a qualitative difference between the two cases, a < 1 and a > 1, has also 
been observed in other works, such as, for example, |20j . 

We wish to draw attention here to the main difference from related drift 
inequalities in the literature. The usual proof of stability involves the Lya- 
punov function ([9]); for instance, for the a-fair policy with a = 1 (the pro- 
portionally fair policy), it involves a weighted quadratic Lyapunov function. 
In contrast, we use L a , a weighted norm function (or its smoothed version), 
which scales linearly along radial directions. In this sense, our approach is 
similar in spirit to [2], which employed piecewise linear Lyapunov functions 
to derive drift inequalities and then moment and tail bounds. The use of 
normed Lyapunov functions to establish stability and performance bounds 
has also been considered in other works; see, for example, [23] and [7]. 

5.1. Proof of Theorem \5.3i a > 1. We wish to decompose the drift term 
in (|18p into the sum of a first-order term and a second-order term, and we 
accomplish this by using the second-order mean value theorem (cf. Proposi- 
tion [23]). Throughout this proof, we drop the subscript a from L a and F a , 
and write L and F, respectively. 

Consider the function L(n) = {Yyi£i w i n ? +1 )~ = [{a + l)F(n)]^ . 
The first derivative of L with respect to n is VL(n) = Vi ? (n)/L a (n) by the 
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chain rule and the definition of L. The second derivative is 

V 2 F(n) VF(n)VL Q (n) T 



V 2 L(n) 



L a (n) L 2 «(n) 

V 2 F(n) VF(n)VF(n) 
a- 



L«(n) L 2 °+l(n) ' 

by the quotient rule and the chain rule. 

Write n for N(r) and n + m for N(r + 1), so that m = N(r + 1) - N(r). 
By Proposition \2A\ for some 6 [0, 1], we have 

(19) L(n + m)-L(n) = m T VL(n) + ^m T V 2 L(n + 6>m)m 

. , m T VF(n) 1 m T V 2 F(n + 6»m)m 

( 2 °) = F777 +o 



(21) 



L a (n) 2 L Q (n + 6»m) 
a m T VF(n + 6>m)VF(n + 6>m) T m 

2 i2a+l( n + ( 9 m ) 



, . m T VF(n) 1 T V 2 F(n + (9m) 

< 22 » 5 -L^F + 2 m i>(n + flm) m - 

since the term m T Vi ? (n + 9m)V F(n + #m) T m is nonnegative. We now 
consider the two terms in f)22f) separately. Recall from the proof of Lemma 
EES that 

E KVF(n) | n] = W 1 ^^" < - £ ™">. 
But (VF(n),i/) = Y^iez w i u i n i' so 

(23) E [m T VF(n) | n] < _ £ £^^< , 
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and so 



E 



m T VF(n) 
L a (n) 



n 



< -£- 



= (E i6 z^< +1 )* 



< -e- 



a+l 



a + l 



-e 



i 



1 a-l 1 



(24) 
where 



-eK, 



(25) 



-K" = K(a, k, /x, i/) 



g-l 1 
a + l ,,a+l 



is a positive load-dependent constant. The second inequality follows from 
the fact that for any vector x, and for any a > 0, ||x|| a +i < ||x|| Q . The 
second to last equality follows from the definition of the Wi (cf. Equation 

(HZD)- 

For the second term in (|22|) . we wish to show that if L(n) is sufficiently 
large, then 

1 r V 2 F(n + 6>m) e 



-in 



-m < -K. 



Note that with probability 1, either m = or m = ±e, for some i € X. 
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Thus 



1 T V 2 F(n + 0m) 1 max !gI [V 2 F(n + 0m)l 

-m — — ; — — m < 



2 L Q (n + 0m) ~ 2 L a (n + 0m 

a maxjgi Wi{rii + 0m 



a-l 



< 



2 [Ei62T u; <( n * + 0m *) a+1 ] Q+1 

a maxjgj Wi(rii + 9mi) a ~ 1 
a 1 



< 2<o +1 K+^o) _1 

< — max w° +1 (n io + 9m io ) , 

where %o € X is such that Wi (rii + 0mj o ) Q_1 = max, g x toj (nj + 9mi) a ~ l . 
Now note that 

— maxu) Q+1 (n, n + 0m, n ) < -K 
(where -fT is defined in (|25p ) if and only if 

i 

amaxiaw^ 1 1 

n io + 0m io > — - , 

A e 

which holds if L(n) > A^'/e for some appropriately defined load-dependent 
constant K' . Thus, if L(n) > K'/e, then 

, s 1 r V 2 F(n + 0m) e 

(26 -m r / „ / m < -K. 

1 ; 2 Z/*(n + 0m) ~ 2 

By adding (|24p and (|26p . we conclude that 

E [L(n + m) - L(n) | n] < ~K, 

when L(n) > K'/e. □ 

5.2. Proof of Theorem \5.3t a 6 (0, 1). The proof in this section is similar 
to that for the case a > 1. We invoke Proposition 12 . 41 to write the drift term 
as a sum of terms, which we bound separately. As in the previous section, 
we drop the subscript a from L a , F a , H a , and h a , and write instead L, F, 
H, and h, respectively. Note that to use Proposition 12.41 we need L to be 
twice continuously differentiable. Indeed, by Lemma l5T2l (i). h is continuously 
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differentiable, so its antiderivative H is twice continuously differentiable, 
and so is L. Thus, by the second order mean value theorem, we obtain an 
equation similar to Equation (1221) : 



(27) L(n + m) - L(n) = m T VL(n) + -m T V 2 L(n + m) m 
^ < Ejgj miWihjni) + 1 _E ie x m$Wih'(ni + flm*) 



L a (n) 2 L Q (n + 0m) 



L Q (n) 2 L a (n + flm) 

for some constant E [0, 1], and where, as before, N(r) = n and N(r + 1) = 
n + m, and the last inequality follows from the fact that with probability 1, 
either m = 0, or m = ±e,;, for some and that h' is nonnegative. 

We now bound the two terms in (|29p separately. Let us first concentrate 
on the term 

Y^ ie i mjWih(ni) 
L a (n) ' 

By Lemma 15.21 (iii), 

miWih(rii) < ^2 miWi(nf + 1) < ^ m^rcf + ^ rmwi, 
iei tei iei iei 

so 

Y^ ieI miWih(rii) < Yjj&i mwinf Ysi&i m i w i 



L a (n) - L a (n) L a (n) 

First consider the term ^' 6 /™^'' n ' ■ Note that Y2iei m i w i n ? = m T VF(n). 
We also recall from the proof of Lemma 14.41 that 

E KVF(n) | n] = < W '°»- "~ " A '"» < - e < VF <^"'. 

We then proceed along the same lines as in the case a > 1, and obtain that 
if L(n) > K 2 /e for some positive load-dependent constant K 2 , then 



E 



L a fn) 



n 



1 

3 maxjgi z/. 



1 q-l 1 

3 max^ T At^/x. a+1 ^ +1 



(30) = —eX, 
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Here as in the proof for the case a > 1, K = K(a, k, ll, v) = max ' eI - 

is a positive load-dependent constant. 

Now consider the term ~~^^y~^ ■ With probability 1, either m = or 
m = ±ej for some i El, and therefore ^2i € x m i w i — maxjgx^i- Thus, 



E 



n 



3 _ maxjpr tfj 
< — eK + 



4 L Q (n) 



For the second term in (|29p . note that with a £ (0,1), Lemma I5.2f iv) 
implies that h! < 2, and therefore, 

1 max ie x Wih'(ni + #mj) max igI 



2 L Q (n + 6»m) ~L a (n + #m) 

Note that L a (n + 0m) and L a (n) differ only by a load-dependent constant, 
since with probability 1, either m = or m = ±ej for some i £ I. Thus, if 
L a (n) > K^/e for some positive load-dependent constant K$, then 

max^gx yjj max igX ^ < 1 
1 J L Q (n) L a (n + (9m) ~ 4 6 



Putting dSOl) and (J3TJ) together, we get that if L(n) > iT/ nim-je 1 /", e}, 

L 3 



where -ftT' = max-fi^g^, K2}, then 



E[L(n + m) --L(n) | n] < ~K. □ 



6. Exponential Tail Bound under a-Fair Policies. In this section, 
we derive an exponential upper bound on the tail probability of the station- 
ary distribution of the flow sizes, under an a-fair policy with a > 0. We will 
use the following theorem, a modification of Theorem 1 from [2]. 

Theorem 6.1. Let X(-) be an irreducible and aperiodic discrete-time 
Markov chain with a countable state space S£ . Suppose that there exists a 
Lyapunov function $ : 3C — > M+ with the following properties: 

(a) <3? has bounded increments: there exists £ > such that for all r, 
we have 

|$(X(t + 1)) - $(X(t))| < f , almost surely ; 
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(b) Negative drift: there exist B > and 7 > such that whenever 



Then, a stationary probability distribution ir exists, and we have an expo- 
nential upper bound on the tail probability of $ under it: for any £ G Z+, 



In particular, in steady state, all moments of <I> are finite, i.e., for every 



Theorem 16.11 is identical to Theorem 1 in [2] except that [2] imposed 
the additional condition E 7r [$(X)] < 00. However, the latter condition is 
redundant. Indeed, using Foster-Lyapunov criteria (see [8], for example), 
conditions (a) and (b) in Theorem 16.11 imply that the Markov chain X has 
a unique stationary distribution ir. Furthermore, Theorem 2.3 in [12] estab- 
lishes that under conditions (a) and (b), all moments of <J?(X) are finite in 
steady state. We note that Theorem 2.3 in |12| and Theorem 1 of [2] provide 
the same qualitative information (exponential tail bounds for $(X)). How- 
ever, [2] contains the more precise bound (|32j) . which we will use to prove 
Theorem 17.61 in Section [71 

Proof of Theorem 13. 21 The finiteness of the moments follows imme- 
diately from the bound in (I32p . so we only prove the exponential bound (I32p . 
We apply Theorem 16.11 to the Lyapunov function L a and the uniformized 
chain N(-). Again, denote the stationary distribution of N(-) by 7r, and 
note that this is also the unique stationary distribution of N(-). The proof 
consists of verifying conditions (a) and (b). 

(a) Bounded Increments. We wish to show that with probability 1, there 
exists £ such that 



As usual, write n = N(t) and n + m = N(r + 1), then m = or m = ±ej 
for some i € I with probability 1. For a > 1, 



$(X(r)) > B 



E[$(X(r + 1)) - $(X(r)) I X(r)] < -7. 



(32) 




k G N 



E^[$ fc (X)] < oo. 



L a (N(r + l))-L a (N(r))| <£. 



a + l 



L a (n) = y^Wjnl 



a+l 
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and for a £ (0, 1), by Lemma 1531 (iii'), we have 



Wi. 



i£l «SX i&X 

In general, for r, s > and f3 € [0, 1], 

(33) {r + sf < rP + s 13 . 

Thus, by inequality (|3"3"|). 



E 

.iei 



w i n i T 



1 

a+l 



2^m; 



1 

a + l 



< L a (n) < 



.iex 



+i 



i 

a + l 



+ 



i 

a + l 



Hence, for any a > 0, 
|L a (n + m) - L a (n)\ < 



a + l 




a + l 















2 E 



< 



E 

.iex 



i 

a + l 



I a+l 



i 

a + l 



< maxu)° +1 + 2 



+ 2 



2 E 



XV, 



■iex 



i 

a + l 



1 

a + l 



where the second last inequality follows from the triangle inequality. Thus 
we can take £ = max ieI w* +1 + 2 [2 £ 

i £ x <u; «J 01+1 ) which is a load-dependent 

constant. 

^ Negative Drift. The negative drift condition is established in Theorem 
15.31 with 7 = eK, for some positive load-dependent constant K. 

Note that we have verified conditions (a) and (b) for the Lyapunov func- 
tion L a . To show the actual exponential probability tail bound for HNH^, 
note that L a (N) > i^'^NHoo, for some load-dependent constant K" . By 
suitably redefining the constants B, £, and K, the same form of exponential 
probability tail bound is established for HNHoo . □ 
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7. An Important Application: Interchange of Limits (a = 1). 

In this section, we assume throughout that a = 1 (the proportionally- fair 
policy), and establish the validity of the heavy-traffic approximation for 
networks in steady state. We first provide the necessary preliminaries to state 
our main theorem, Theorem l7.61 In Section \7.2\ we state and prove Theorem 
17,61 which is a consequence of Lemmas 17.71 and 17.81 Further definitions and 
background are provided in Section 17.31 along with the proofs of Lemmas 
17.71 and 17.81 All definitions and background stated in this section are taken 
from HH and [13]. 



7.1. Preliminaries. We give a preview of the preliminaries that we will 
introduce before stating Theorem 17.61 The goal of this subsection is to pro- 
vide just enough background to be able to state Theorem 17.51 the diffusion 
approximation result from [13] . To do this, we need a precise description of 
the process obtained in the limit, under the diffusion scaling. This limiting 
process is a diffusion process, called Semimartingale Reflecting Brownian 
Motion (SRBM) (Definition 17. 3p . with support on a polyhedral cone. This 
polyhedral cone is defined through the concept of an invariant manifold 
(Definition [72]). 

As in Section T4.3( we consider a sequence of networks indexed by r, where 
r is to be thought of as increasing to infinity along a sequence. The incidence 
matrix A, the capacity vector C, and the weight vector k do not vary with 
r. Recall the heavy-traffic condition — Assumption 14.71 and the definitions 
of the workload w, the workload process W, and the lifting map A from 
Definition 14.81 We carry the notation from Section 14.31 so that 6 > 0, and 
v T — > v > 0, p r — »■ fi > and r(C — Ap r ) — > as r — »■ oo. Recall that 
Ap = C. Let N r and W r be as in (fl5t 

The continuity of the lifting map A will be useful in the sequel. 



Proposition 7.1 (Proposition 4.1 in [13]). The function A : 
W, is continuous. Furthermore, for each w € M + and c > 0, 

(34) A(cw) = cA(w). 



Definition 7.2 (Invariant manifold). A state n G is called invari- 
ant if n = A(w), where w = AM" 1 n is the workload, and A the lifting 
map defined in Definition \4-8[ The set of all invariant states is called the 
invariant manifold, and we denote it by ' . We also define the workload 
cone ■W byW = AM" 1 !, where M = diag( p) is as defined in Definition 
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The invariant manifold jtft is a polyhedral cone and admits an explicit 
characterization: we can write it as 

Jt = jn G Rj 1 : rii = A ^ for all i G 1, for some q G R!f 1 j . 

Denote the j-th face of jtft by which can be written as 
^ 4 in G R? 1 : n, = Msl^ll for all { € T 

y ^ 

for some q € rIjT' satisfying gj = 0^. 
Similarly, denote the j-th face of W by which can be written as 

Semi-martingale Reflecting Brownian Motion (SRBM). 

Definition 7.3. Define the covariance matrix 

T = 2AM~ 1 diag(u)M- 1 A T . 

An SRBM that lives in the cone W , has direction of reflection ej (the jth 
unit vector) on the boundary W 3 for each j G J , has drift 9 and covari- 
ance T, and has initial distribution rj on W is an adapted, \J\- dimensional 
process W(-) defined on some filtered probability space (fl, {^t}, IP) such 
that: 

(i) P-a.s., W(t) = W(0) + X(t) + U(t) for all t > 0; 

(ii) P-a.s., W(-) has continuous sample paths, W(t) G #^ /or all t > 0, 
and W(0) has initial distribution rj ; 

(Hi) under P, X(-) is a |J7"|- dimensional Brownian motion starting at the 

origin with drift 9 and covariance matrix T; 
(iv) for each j G J , Uj(-) is an adapted, one- dimensional process such that 

P-a.s., 

(a) Uj(0) = 0; 

(b) Uj is continuous and non- decreasing; 

(c) Uj(t) = J tl[ {w( s ) 6 ^}4( s ) f° r al1 1 ^ °- 

The process W(-) is called an SRBM with the data (W,9,T,{ej : j G 
J},Vo)- 
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Diffusion Approximation for a = 1. 

Assumption 7.4 (Local traffic). For each j € J , there exists at least 
one i € X such that Aji > and = for all k ^ j. 

Under the local traffic condition, a diffusion approximation holds. 

Theorem 7.5 (Theorem 5.2 in [13]). Assume that a = 1 and that the 
local traffic condition, Assumption \7.4\ holds. Suppose that the limit distri- 
bution of W r (0) as r — > oo is r/ (a probability measure onW) and that 

(35) ||N r (0) - A(W r (0))||oo 0, in probability, as r -> oo. 

Then, the distribution of (W r (-), N r (-)) converges weakly (on compact time 
intervals) as r — > oo to a continuous process (W(-), N(-)), where W(-) is an 
SRBM with data (W,6,T, {ej,j G J},Vo) and N(t) = A(W(i)) for all t. 

7.2. Interchange of Limits. We now know that for a = 1, under the 
local traffic condition, the diffusion approximation holds. That is, the scaled 
process (W r (-), N r (-)) converges in distribution to (W(-),N(-)), with W(-) 
being an SRBM. For any r, the scaled processes N r (-) also have stationary 
distributions 7v r , since they are all positive recurrent. These results can be 
summarized in the diagram that follows. 



NT)| 



[°> T 1 Theorem [73] 



I [0,71 



OO 



oo 



oo 



TV 



••- 7T 



As can be seen from the diagram, two natural questions to ask are: 

1. Does the diffusion process N(-) have a stationary probability distribu- 
tion, 7T? 

2. If tt exists and is unique, do the distributions tt v converge to 7r? 

Our contribution here is a positive answer to question 2. More specifically, 
if N(-) has a unique stationary probability distribution tt, then ir r converges 
in distribution to tt. 
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Theorem 7.6. Suppose that a = 1 and that the local traffic condition, 
Assumption \ 7.4\ holds. Suppose further that N(-) has a unique stationary 
probability distribution tv. For each r, let 7v r be the unique stationary prob- 
ability distribution q/N r . Then, 

7r r — > 7r, in distribution, as r — > oo. 

The line of proof of Theorem 17.61 is fairly standard. We first establish 
tightness of the set of distributions {7r r } in Lemma 17.71 Letting the pro- 
cesses N r (-) be initially distributed as {•7r r }, we translate this tightness con- 
dition into an initial condition similar to ()35|) . in Lemma [7.81 We then apply 
Theorem [73] to deduce the convergence of the processes N r (-), which by sta- 
tionarity, leads to the convergence of the distributions 7T r . We state Lemmas 
17.71 and 17.81 below, and defer their proofs to the next section. 

Lemma 7.7. Suppose that a = 1. The set of probability distributions 
{7r r } is tight. 

Lemma 7.8. Consider the stationary probability distributions ir r o/N r (-), 
and let {7r rfe } be any convergent subsequence of {7r r }. Let N r (0) be dis- 
tributed as 7r r for each r. Then there exists a subsequence ri of r^ such 
that 



(36) N r ^(0)-A (W r '(0) 







in probability as £ — >• oo, i.e., such that condition $35)) holds for the subse- 
quence {(W r *(-),N r « (•))}. 



Proof of Theorem \ 7. 6\ Since {7r r } is tight by Lemma [7. 7\ Prohorov's theo- 
rem implies that {7r r } is relatively compact in the weak topology. Let {7r rfe } 
be a convergent subsequence of the set of probability distributions {7r r }, 
and suppose that 7r rfe — > 7r as k — > oo, in distribution. 

Let N r (0) be distributed as 7r r for each r. Then by Lemma 17.81 there 
exists a subsequence r£ of such that 

N r< (0) — A (w n (0) 



-> 

in probability as £ — > oo. Denote the distribution of W r (0) by rf . Since 
7r rk — > 7r as k — > oo, 7v re — > 7r as £ — > oo as well, and rf 1 —¥ t] as £ — > oo, for 
some probability distribution r]. 

We now wish to apply Theorem 17.51 to the sequence {N r<? (-)}. The only 
condition that needs to be verified is that r] has support on W. This can be 
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argued as follows. Let N(0) have distribution it, and let W(0) = AiW _1 N(0) 
be the corresponding workload. Then W Tf (0) — > W(0) in distribution as 
r — > oo, and W(0) has distribution rj. The lifting map A is continuous by 
Proposition 17. 1\ so A ^W r *(0)^ — > A ^W(O)^ in distribution as r — > oo. 

This convergence, together with (|36p and the fact that N r£ (0) — > N(0) in 
distribution, implies that N(0) and A ^W(O)^ are identically distributed. 

Now A ^W(O)^ has support on jtft , so N(0) is supported on ^# as well, and 

so W(0), hence 77, is supported on W . 

By Theorem[73J (W r£ (-), N r * (•)) converges in distribution to a continuous 
process (W(-),N(-)). Suppose that W(-) and N(-) have unique stationary 
distributions r) and 7r, respectively. The processes (W re (-), N r ^(-)) are sta- 
tionary, so (W(-),N(-)) is stationary as well. Therefore, W(0) and N(0) are 
distributed as f] and tt, respectively. Since (W re (0), N r « (0)) (W(0), N(0)) 
in distribution, we have that rf e — > f] and ir re — > tv weakly as I —> 00. This 
shows that 7V = tt and rj = rj. Since {7r rfc } is an arbitrary convergent sub- 
sequence, tt is the unique weak limit point of {7r r }, and this shows that 
TT r — > tt in distribution. □ 

For Theorem 17.61 to apply, we need to verify that N(-) (or equivalently, 
W(-)) has a unique stationary distribution. The following theorem states 
that when Ki = 1 for all i £ I, this condition holds; more specifically, the 
SRBM W(-) has a unique stationary distribution, which turns out to have 
a product form. 

Theorem 7.9 (Theorem 5.3 in |13j). Suppose that a = 1 and Ki = 1 
for all i G Z. Let f] be the measure on W that is absolutely continuous with 
respect to Lebesgue measure with density given by 

(37) p(w) = exp((v, w», w £ W, 
where 

(38) v = 2r~ 1 0. 

The product measure f) is an invariant measure for the SRBM with state 
space W, directions of reflection {ej,j £ J}, drift 0, and covariance matrix 
r . After normalization, it defines the unique stationary distribution for the 
SRBM. 



By Theorems 17.61 and 17.91 the following corollary is immediate. 
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Corollary 7.10. Suppose that a = 1 and k« = 1 for all i El. Suppose 
further that the local traffic condition, Assumption holds. Let tt be the 
unique stationary probability distribution o/N(-). For each r, let 7r r be the 
unique stationary probability distribution o/N r . Then, 

7r r — > 7r, in distribution, as r — > oo. 



7.3. Proof of Lemmas 7.7 and 7., 



Proof of Lemma \ 7. 7[ To establish tightness, it suffices to show that for 

Ixl 

every y > there exists a compact set C M + such that 

(39) limsu P 7r r (mJ '\K y ) < e" 2 '. 

r— too * ' 

We now proceed to define the compact sets K^. As in the proof of Theorem 
14.101 let e r = e(p r ) be the gap in the rth system. Then, under Assumption 
14.71 f° r sufficiently large r, e r > D/r for some network-dependent constant 
D > 0. Since a = 1, Theorem 13 . 2 1 implies that for the rth system, there exist 
load-dependent constants K r > and £ r > such that for every £ € Z+, 

(40) fllNloc > — + 2£ P *) < 1 



By the definition of a positive load-dependent constant, there exist contin- 
uous functions f\ and ji on the open positive orthant such that for all r, 
K r = fi(n r ,u r ) and £ r = /2(/x r ,i/ r ). Since /x r — > /i. > and i/ r — > i/ > 0, 
we have K r K = i/) > and £ r — >• £ = /2(/x, f ) > 0. Define 



A 



>ixi , ii ii ^ (^+i)+4(e+i)^ 



^ v e M£' : HvHoo < - — — 

We now show that (|39p holds, or equivalently, by the definition of ~K y , we 
show that for every y > 0, 

(4!) limsup^ ( Vlloo > (* + !>+ + < 

r->oo \ r D J 

Let ^ r = |_2£ r y/£rJ> where for z € R, [zj is the largest integer not exceeding 
z. By PU|) . we have 
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Taking logarithms on both sides, we have 

logP^r f-HNloo > ^ + < ~{lr + 1) log (l + E f 

\r re r r J \ sr 

Since e r — >• and £ r — > £ > as r — > oo, ^ < 1 for sufficiently large r. Since 
log(l + t) > t/2 for t £ [0, 1], we have 

-(^ + l)log + <-(^ + l)|-, 

when r is sufficiently large. By definition, t r = [2^ r y/e r \, so £ r +l > 2£ r x/e r , 
or equivalently, — (£ r + 1)^- < — y. Thus, when r is sufficiently large, 

iogP„, - N r oo > — + < -y- 

Consider the term + r , When r is sufficiently large, re r > -D, K r < 
K + 1 , and ^ r < £ + 1 , and so 

K r | 2j r i r < if r | 2^(2^) < A- + 1 | 4(g + l) 2 ?/ 
re r r ~~ re r re r ~ D D 

Thus, for sufficiently large r, 

\r D 

< lQgP^r - N r oo > — + < -y- 

\r re r r J 

This establishes f|41 1) . and also the tightness of {^ r }- □ 

Next, we prove Lemma 17.81 To this end, we need some definitions and 
background. In particular, we need the concept and properties of fluid model 
solutions. 

Definition 7.11. A fluid model solution (FMS) is an absolutely con- 
tinuous function n : [0, oo) — s> ' such that at each regular point t > of 
n(-), we have, for each i 6 X, 

( Ar%\ d _ f Vi - mAi(n(t)), ifriiit) > 0, 

[ > dt nt[) ~\0, ifrn(t) = 0, 



1 A point t £ (0, oo) is a regular point of an absolutely continuous function / : [0, oo) — > 

XI 

W + if each component of / is differentiable at t. Since n is absolutely continuous, almost 
every time t £ (0, oo) is a regular point for n. 
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and for each j G J , 

(43) J2 ^iiA<(n(t))+ AjifH^Cj, 

i€X + (n(t)) ieX (n(t)) 

w/iere 2+(n(t)) = {i 6 I : n;(t) > 0} and Z (n(i)) = {iel: nj(i) = 0}. 
Note that here Ap = C. 

We now collect some properties of a FMS. The following proposition states 
that the invariant manifold ^# consists exactly of the stationary points of a 
FMS. 

Proposition 7.12 (Theorem 4.1 in Q2]). A vector no is an invariant 
state, that is, no € jtfi ' , if and only if for every fluid model solution n(-) with 
n(0) = no, we have n(t) = no for all t > 0. 

The following theorem states that starting from any initial condition, a 
FMS will eventually be close to the invariant manifold *M . 

Theorem 7.13 (Theorem 5.2 in [I4]). Fix R € (0, oo) and 5 > 0. T/iere 
is a constant T^s < oo such that for every fluid model solution n(-) satisfying 
||n(0)||oo < R we have 

d(n(t), Jt) < 5, for all t > T RtS , 

where d(n(t),^) = inf ne ^f ||n — n(t)||oo is the distance from n(t) to the 
manifold <M . 

Proposition ^. 14l states that the value of the Lyapunov function F\ defined 
in ([9]) decreases along the path of any FMS. 

Proposition 7.14 (Corollary 6.1 in [H]). At any regular point t of a 
fluid model solution n(-), we have 

!*M*»<o, 

and the inequality is strict if n(t) £ 

Using Proposition 17.141 and the continuity of the lifting map A, we can 
translate Theorem 17.131 into the following version, which will be used to 
prove Lemma 17.81 
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Lemma 7.15. Fix R G (0, oo) and 5 > 0. There is a constant Tr,,5 < oo 
such that for every fluid model solution n(-) satisfying \\ 

n(0)||oo — R have 
||n(t) - A(w(t))||oo < 5, for all t > T R>S , 
where w(i) = w(n(i)) is the workload corresponding to n(i) (see Definition 

Proof. Fix R > and 5 > 0. Let ||n(0)||oo < R. Then, 

F 1 (n(0)) = ^vr 1 Ki n 2 i (0)<R>, 
iei 

where R' depends on R and the system parameters. Since n(-) is absolutely 
continuous, by Proposition 17.141 and the fundamental theorem of calculus, 
we have that i ? i(n(t)) < R' for all t > 0. Define the set 

S = {n£ R+ ' : Fi(n) < R'}, 

and its 5-fattening 

Ss — {n G R+' : ||n — n'|| < 5 for some n' G 5}. 

Note that both 5 and S$ are compact sets, and n(i) £5c5j for all t > 0. 

Now consider the workload w defined in Definition 14.81 Define the set 
w(<Ss) = {v G W, : v = w(n) for some n G 5,5}. Since w is a linear map, 
there exists a load-dependent constant H such that 

||w(n) - w(n')||oo < H\\n - n'Hoo, 

ixi 

for any n, n' G Rj_ . Thus w(«S,s) is also a compact set. Since n(t) G Ss for 
all i > 0, w(i) G w(<S<5) for all t > 0. By Proposition 17. 1| A is a continuous 
map, so A is uniformly continuous when restricted to w(Ss)- Therefore, 
there exists 5' > such that for any w',w G w(S$) with ||w' — w||oo < 5', 
||A(w')-A( w)||oo <C ^. Thus for any n, n' G S$ with ||n — n'Hoo < S'/H, we 
have ||w(n) — w(n')|| < 5', and 

||A(w(n))-A(w(n'))||oo<^. 

Let 5" = min{5/2, S'/H}. By Theorem 17. 131 there exists Tr$>i such that for 
all t > Tjis>', 

d(jt,n(t)) < 5". 
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In particular, there exists n G j$ (which may depend on n(t)) such that 
||n - n(t)||oo < 5" < d'/H. Since n(t) € S and 5" < 5, n £ S$ as well. Thus 

||A(w(n))-A(w(n(i)))||oc<4 
By Proposition 17.12] since n G we have n = A(w(n)), and hence 

||n-A(w(n(t)))|| 00 < -. 

Thus for all t > Trs", 

||n(t)-A(w(t))||oo < lln-n^lloo + Hn- A(w(t))|| 0O 

< 5 + - < - + - = 5. 
2 - 2 2 

Note that 5" depends on R, 6, and the system parameters. Thus, we can 
rewrite Tr^h as Tr g . This concludes the proof of the lemma. □ 

The last property of a FMS that we need is the tightness of the fluid-scaled 
processes N r and W r , defined by 

(44) N r (t) = N r (rt)/r, and W r (t) = W r (rt)/r. 

Theorem 7.16 (Theorem B.l in [H]). Suppose that {N r (0)} converges 

\i\ 

in distribution as r — > oo to a random variable taking values in W + . Then, 
the sequence {N r (-)} is C-tigh^, and any weak limit point N(-) of this 
sequence, almost surely satisfies the fluid model equations and |J5p. 



Proof of Lemma |7.ff[ Consider the unique stationary distributions 7r r of 
N r (-), and rf of W r (-). Let ir rk be a convergent subsequence, and suppose 
that 7r rfc — > 7r in distribution, as k — > oo. Suppose that at time 0, ^-N rfc (0) 

' k 

is distributed as iv rk . Then ^W rfc (0) is distributed as rf k , which converges 
in distribution as well, say to r\. 

We now use the earlier stated FMS properties to prove the lemma. Note 
that for all r, 

^N r (0) = N r (0) = N r (0) and ^W r (0) = W r (0) = W r (0), 



2 Consider the space D' 1 ' of functions / : [0, oo) — > R' 1 ' that are right-continuous on 
[0, oo) and have finite limits from the left on (0, oo). Let this space be endowed with 
the usual Skorohod topology (cf. Section 12 of [3]). The sequence {N r (-)} is tight if the 
probability measures induced on D' z ' are tight. The sequence is C-tight if it is tight and 
any weak limit point is a measure supported on the set of continuous sample paths. 
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and consider the fluid-scaled processes N rfc (-) and W rfc (-). Since |N rfe (0)} 
converges in distribution to 7r, Theorem 17.161 implies that the sequence 
{N rfc (-)} is C-tight, and any weak limit N(-) almost surely satisfies the fluid 
model equations. Let N(-) be a weak limit point of {N r, = (-)}, and suppose 
that the subsequence {N r£ (-)} of {N rfc (-)} converges weakly to N(-). 
Let 5 > 0. We will show that we can find r(5) such that for ri > r(5), 

P(||N^(0)-A(W^(0))|U>J) < S. 

Since N(0) is a well-defined random variable, there exists R$ > such that 

P(||N(0)|| oo >i? 5 ) < 5 -. 

Now, for all sample paths u such that ||N(0)(w)|| oo < R$, and such that 
N(-)(cj) satisfies the fluid model equations, Lemma 17.151 implies that there 
exists T = Tr 5> s such that 

||N(T)(a;)-A(W(r))H|| 00 < 6. 

Since N(-) satisfies the fluid model equations almost surely, we have 

P(||N(r)-A(W(T))|| 00 <5)>l-|. 

Now for each r, N r (0) is distributed according to the stationary distribution 
7r r , so N r (-) is a stationary process. Since N rf (•) — > N(-) weakly as I — >• oo, 
N is also a stationary process. Thus, N(T) and N(0) are both distributed 
according to 7v. This implies that 

P(||N(0)-A(W(0))|| 00 <5)>1-^. 

Furthermore, since N r *(0) — > N(0) in distribution, 

P(||N r <(0) - A(W^(0))||oo < 5) -> P(||N(0) - A(W(0))||oo < 5) 
as £ — > oo. Thus there exists r(5) such that for all ri > r(5), 

P(||N^(0) - A(W^(0))||oo < 5) > 1 - 5. 
Since 5 > is arbitrary, 

||N^(0) - A(W r «(0))|U = ||N r '(0) - A(W^(0))|U 0, 
in probability. □ 
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8. Conclusion. The results in this paper can be viewed from two dif- 
ferent perspectives. On the one hand, they provide much new information 
on the qualitative behavior (e.g., finiteness of the expected number of flows, 
bounds on steady-state tail probabilities and finite-horizon maximum excur- 
sion probabilities, etc.) of the a- fair policies for bandwidth-sharing network 
models. At an abstract level, our results highlight the importance of re- 
lying on a suitable Lyapunov function. Even if a network is shown to be 
stable by using a particular Lyapunov function, different choices and more 
detailed analysis may lead to more powerful bounds. At a more concrete 
level, we presented a generic method for deriving full state space collapse 
from multiplicative state space collapse, and another for deriving steady- 
state exponential tail bounds. The methods and results in this paper can be 
extended to general switched network models. Parallel results for a packet 
level model are detailed in [18] . 
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